A Comparative Study of Estimation by Analogy using Data Mining Techniques

نویسندگان

  • Geeta Nagpal
  • Moin Uddin
  • Arvinder Kaur
چکیده

Software Estimations provide an inclusive set of directives for software project developers, project managers, and the management in order to produce more realistic estimates based on deficient, uncertain, and noisy data. A range of estimation models are being explored in the industry, as well as in academia, for research purposes but choosing the best model is quite intricate. Estimation by Analogy (EbA) is a form of case based reasoning, which uses fuzzy logic, grey system theory or machine-learning techniques, etc. for optimization. This research compares the estimation accuracy of some conventional data mining models with a hybrid model. Different data mining models are under consideration, including linear regression models like the ordinary least square and ridge regression, and nonlinear models like neural networks, support vector machines, and multivariate adaptive regression splines, etc. A precise and comprehensible predictive model based on the integration of GRA and regression has been introduced and compared. Empirical results have shown that regression when used with GRA gives outstanding results; indicating that the methodology has great potential and can be used as a candidate approach for software effort estimation. Keywords—Software Estimations, Estimation by Analogy, Grey Relational Analysis, Robust Regression, Data Mining Techniques

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Estimation of Punching Shear Capacity of Concrete Slabs Using Data Mining Techniques

Punching shear capacity is a key factor for governing the collapsed form of slabs. This fragile failure that occurs at the slab-column connection is called punching shear failure and has been of concern for the engineers. The most common practice in evaluating the punching strength of the concrete slabs is to use the empirical expressions available in different building design codes. The estima...

متن کامل

Presented a method for estimating the cost of software using PCA to reduce the size and with the help of data mining

  These days, data mining one of the most significant issues. One field data mining is a mixture of computer science and statistics which is considerably limited due to increase in digital data and growth of computational power of computer. One of the domains of data mining is the software cost estimation category. In this article, classifying techniques of learning algorithm of machine ...

متن کامل

Estimation of geochemical elements using a hybrid neural network-Gustafson-Kessel algorithm

Bearing in mind that lack of data is a common problem in the study of porphyry copper mining exploration, our goal was set to identify the hidden patterns within the data and to extend the information to the data-less areas. To do this, the combination of pattern recognition techniques has been used. In this work, multi-layer neural network was used to estimate the concentration of geochemical ...

متن کامل

Limestone chemical components estimation using image processing and pattern recognition techniques

In this study based on image analysis, an ore grade estimation model was developed. The study was performed at a limestone mine in central Iran. The samples were collected from different parts of the mine and crushed in size from 2.58 cm down to 15 cm. The images of the samples were taken in appropriate environment and processed. A total of 76 features were extracted from the identified rock sa...

متن کامل

A Comparative Study between a Pseudo-Forward Equation (PFE) and Intelligence Methods for the Characterization of the North Sea Reservoir

This paper presents a comparative study between three versions of adaptive neuro-fuzzy inference system (ANFIS) algorithms and a pseudo-forward equation (PFE) to characterize the North Sea reservoir (F3 block) based on seismic data. According to the statistical studies, four attributes (energy, envelope, spectral decomposition and similarity) are known to be useful as fundamental attributes in ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • JIPS

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2012